Pitfalls of assessing extracted hierarchies for multi-class classification

نویسندگان

چکیده

Using hierarchies of classes is one the standard methods to solve multi-class classification problems. In literature, selecting right hierarchy considered play a key role in improving performance. Although different have been proposed, there still lack understanding what makes good and method extract perform better or worse. To this effect, we analyze compare some most popular approaches extracting hierarchies. We identify common pitfalls that may lead practitioners make misleading conclusions about their methods. address these problems, demonstrate using random an appropriate benchmark assess how hierarchy’s quality affects particular, show can become irrelevant depending on experimental setup: when powerful enough classifiers, final performance not affected by hierarchy. also comparing effect against non-hierarchical might incorrectly indicate superiority. Our results confirm datasets with high number generally present complex structures relate each other. datasets, dramatically improve

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-label classification and extracting predicted class hierarchies

This paper investigates hierarchy extraction from results of multi-label classification (MC). MC deals with instances labeled by multiple classes rather than just one, and the classes are often hierarchically organized. Usually multi-label classifiers rely on a predefined class hierarchy. A much less investigated approach is to suppose that the hierarchy is unknown and to infer it automatically...

متن کامل

Multi-class Boosting with Class Hierarchies

We propose AdaBoost.BHC, a novel multi-class boosting algorithm. AdaBoost.BHC solves a C class problem by using C− 1 binary classifiers defined by a hierarchy that is learnt on the classes based on their closeness to one another. It then applies AdaBoost to each binary classifier. The proposed algorithm is empirically evaluated with other multi-class AdaBoost algorithms using a variety of datas...

متن کامل

Exploiting Associations between Class Labels in Multi-label Classification

Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...

متن کامل

Multi Class Classification

Generalization error of classifier can be reduced by larger margin of separating hyperplane. The proposed classification algorithm implements margin in classical perceptron algorithm, to reduce generalized errors by maximizing margin of separating hyperplane. Algorithm uses the same updation rule with the perceptron, to converge in a finite number of updates to solutions, possessing any desirab...

متن کامل

Extreme Multi Class Classification

We consider the multi class classification problem under the setting where the number of labels is very large and hence it is very desirable to efficiently achieve train and test running times which are logarithmic in the label complexity. Additionally the labels are feature dependent in our setting. We propose a reduction of this problem to a set of binary regression problems organized in a tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2023

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2022.109225